Zero-Shot Learning Disrupts 'Segment Everything'! SAMURAI Breaks Through Video Tracking Bottlenecks, Locking Targets in Real Time Effortlessly!
The 'Segment Anything' model SAM launched by Meta has been a force to be reckoned with in the field of image segmentation, but it struggles when it comes to video object tracking, especially in crowded, fast-moving, or hide-and-seek scenarios. This is due to SAM's memory mechanism, which acts like a 'fixed window', only focusing on the most recent frames while ignoring the quality of the memory content, leading to error propagation in videos and significantly diminished tracking performance. To address this issue, the University of Washington's...